CHIL – IP 506909 – Computers in the Human Interaction Loop Title Speaker Localization and Tracking - Evaluation Criteria
نویسندگان
چکیده
Version: 5.0 January 18th, 2005 Page 1/26 © CHIL ITC-irst Project CHIL – IP506909 – Computers in the Human Interaction Loop Title Speaker Localization and Tracking Evaluation Criteria Workpackage WP4 Speaker Localization and Tracking – Evaluation Criteria Classification Draft Dissemination level CC: Confidential to the CHIL Consortium Version 5.0 Date January 18, 2005 Number of pages 26 Document ID CHIL-IRST_SpeakerLocEval –V5.0-2005-01-18-CC Partners ITC-irst Authors Maurizio Omologo, Alessio Brutti, Piergiorgio Svaizer Contributors See above + Luca Cristoforetti, Paolo Coletti Final editing Maurizio Omologo Synopsis Evaluation Criteria for the Speaker Localization and Tracking Task
منابع مشابه
First Experiments of Automatic Speech Activity Detection, Source Localization and Speech Recognition in the Chil Project
In the workspace of the future, a so-called “ambient intelligence” will be realized through the widespread use of sensors (e.g., cameras, microphones, directed audio devices) connected to computers that are unobtrusive to their human users. Towards this end of ubiquitous computing, technological advances in multi-channel acoustic analysis are needed in order to solve several basic problems, inc...
متن کاملCHIL - Computers in the Human Interaction Loop
CHIL ("Computers in the Human Interaction Loop") is an Integrated Project under the European Commission's Sixth Framework Programme. The CHIL consortium is jointly coordinated by Universität Karlsruhe (TH) and the Fraunhofer Institute IITB. CHIL was launched on January 1st, 2004. The objective of this project is to explore and create environments in which computers serve humans who focus on int...
متن کاملActive Speaker Localisation and Tracking using Audio and Video
This thesis is concerned with the problem of tracking active speakers using audio and video data. Particular focus is placed on the task of tracking the current active speaker in a lecture room environment using multiple cameras and multiple microphones. A database of lecture recordings corresponding to this scenario from the European Integrated Project, Computers in the Human Interaction Loop ...
متن کاملA Generative Approach to Audio-Visual Person Tracking
The audio based speaker localization and tracking task addressed in CHIL is rather challenging. Since the evaluation data have been collected during real seminars and meetings they present some critical aspects to the localization process. First of all, seminar and meeting rooms are typically characterized by a high reverberation time (for example in the ITC-irst CHIL room the reverberation tim...
متن کاملSpeaker Tracking in Seminars by Human Body Detection
This paper presents evaluation results of a method for tracking speakers in seminars from multiple cameras. First, 2D human tracking and detection is done for each view. Then, 2D locations are converted to 3D based on the calibration parameters. Finally, cues from multiple cameras are integrated in a incremental way to refine the trajectories. We have developed two multi-view integration method...
متن کامل